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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Rodier, Patricia M. 

Ingram, Jennifer L. 
Figlewicz, Denise A. 
Hyman, Susan L. 
Stodgell, Christopher J. 

(ii) TITLE OF INVENTION: GENETIC POLYMORPHISMS WHICH ARE 

ASSOCIATED WITH AUTISM SPECTRUM DISORDERS 

(iii) NUMBER OF SEQUENCES: 10 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Nixon, Hargrave, Devans & Doyle LLP 

(B) STREET: Clinton Square, P.O. Box 1051 

(C) CITY: Rochester 



Q 
€i 

lp (D) STATE: New York 

!!! (E) COUNTRY: U.S.A. 

13 { F) ZIP: 14603-1051 
III 

jjj (v) COMPUTER READABLE FORM: 

jjfj (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

j ai (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

J~ (D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 
\Zl (B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/049,803 

(B) FILING DATE: 17-JUN-1997 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Goldman, Michael L. 

(B) REGISTRATION NUMBER: 30,727 

(C) REFERENCE/DOCKET NUMBER: 176/60181 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (716) 263-1304 

(B) TELEFAX: (716) 263-1600 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1008 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
ATGGACAATG CAAGAATGAA CTCCTTCCTG GAATACCCCA TACTTAGCAG TGGCGACTCG 
GGGACCTGCT CAGCCCGAGC CTACCCCTCG GACCATAGGA TTACAACTTT CCAGTCGTGC 
GCGGTCAGCG CCAACAGTTG CGGCGGCGAC GACCGCTTCC TAGTGGGCAG GGGGGTGCAG 
ATCGGTTCGC CCCACCACCA CCACCACCAC CACCATCACC ACCCCCAGCC GGCTACCTAC 
CAGACTTCCG GGAACCTGGG GGTGTCCTAC TCCCACTCAA GTTGTGGTCC AAGCTATGGC 
TCACAGAACT TCAGTGCGCC TTACAGCCCC TACGCGTTAA ATCAGGAAGC AGACGTAAGT 
GGTGGGTACC CCCAGTGCGC TCCCGCTGTT TACTCTGGAA ATCTCTCATC TCCCATGGTC 
CAGCATCACC ACCACCACCA GGGTTATGCT GGGGGCGCGG TGGGCTCGCC TCAATACATT 
CACCACTCAT ATGGACAGGA GCACCAGAGC CTGGCCCTGG CTACGTATAA TAACTCCTTG 
TCCCCTCTCC ACGCCAGCCA CCAAGAAGCC TGTCGCTCCC CCGCATCGGA GACATCTTCT 
CCAGCGCAGA CTTTTGACTG GATGAAAGTC AAAAGAAACC CTCCCAAAAC AGGGAAAGTT 
GGAGAGTACG GCTACCTGGG TCAACCCAAC GCGGTGCGCA CCAACTTCAC TACCAAGCAG 
CTCACGGAAC TGGAGAAGGA GTTCCACTTC AACAAGTACC TGACGCGCGC CCGCAGGGTG 
GAGATCGCTG CATCCCTGCA GCTCAACGAG ACCCAAGTGA AGATCTGGTT CCAGAACCGC 
CGAATGAAGC AAAAGAAACG TGAGAAGGAG GGTCTCTTGC CCATCTCTCC GGCCACCCCG 
CCAGGAAACG ACGAGAAGGC CGAGGAATCC TCAGAGAAGT CCAGCTCTTC GCCCTGCGTT 
CCTTCCCCGG GGTCTTCTAC CTCAGACACT CTGACTACCT CCCACTGA 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 335 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Asp Asn Ala Arg Met Asn Ser Phe Leu Glu Tyr Pro lie Leu Ser 
15 10 15 

Ser Gly Asp Ser Gly Thr Cys Ser Ala Arg Ala Tyr Pro Ser Asp His 
20 25 30 

Arg lie Thr Thr Phe Gin Ser Cys Ala Val Ser Ala Asn Ser Cys Gly 
35 40 45 
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Gly Asp Asp Arg Phe Leu Val Gly Arg Gly Val Gin lie Gly Ser Pro 
50 55 60 

His His His His His His His His His His Pro Gin Pro Ala Thr Tyr 
65 70 75 80 

Gin Thr Ser Gly Asn Leu Gly Val Ser Tyr Ser His Ser Ser Cys Gly 
85 90 95 

Pro Ser Tyr Gly Ser Gin Asn Phe Ser Ala Pro Tyr Ser Pro Tyr Ala 
100 105 110 

Leu Asn Gin Glu Ala Asp Val Ser Gly Gly Tyr Pro Gin Cys Ala Pro 
115 120 125 

Ala Val Tyr Ser Gly Asn Leu Ser Ser Pro Met Val Gin His His His 
130 135 140 

His His Gin Gly Tyr Ala Gly Gly Ala Val Gly Ser Pro Gin Tyr lie 
145 150 155 160 

His His Ser Tyr Gly Gin Glu His Gin Ser Leu Ala Leu Ala Thr Tyr 
165 170 175 

Asn Asn Ser Leu Ser Pro Leu His Ala Ser His Gin Glu Ala Cys Arg 
180 185 190 

Ser Pro Ala Ser Glu Thr Ser Ser Pro Ala Gin Thr Phe Asp Trp Met 
195 200 205 

Lys Val Lys Arg Asn Pro Pro Lys Thr Gly Lys Val Gly Glu Tyr Gly 
210 215 220 

Tyr Leu Gly Gin Pro Asn Ala Val Arg Thr Asn Phe Thr Thr Lys Gin 
225 230 235 240 

Leu Thr Glu Leu Glu Lys Glu Phe His Phe Asn Lys Tyr Leu Thr Arg 
245 250 255 

Ala Arg Arg Val Glu lie Ala Ala Ser Leu Gin Leu Asn Glu Thr Gin 
260 265 270 

Val Lys lie Trp Phe Gin Asn Arg Arg Met Lys Gin Lys Lys Arg Glu 
275 280 285 

Lys Glu Gly Leu Leu Pro lie Ser Pro Ala Thr Pro Pro Gly Asn Asp 
290 295 300 



Glu Lys Ala Glu Glu Ser Ser Glu Lys Ser Ser Ser Ser Pro Cys Val 
305 310 315 320 



Pro Ser Pro Gly Ser Ser Thr Ser Asp Thr Leu Thr Thr Ser His 
325 330 335 
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(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1008 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
ATGGACAATG CAAGAATGAA CTCCTTCCTG GAATACCCCA TACTTAGCAG TGGCGACTCG 
GGGACCTGCT CAGCCCGAGC CTACCCCTCG GACCATAGGA TTACAACTTT CCAGTCGTGC 
GCGGTCAGCG CCAACAGTTG CGGCGGCGAC GACCGCTTCC TAGTGGGCAG GGGGGTGCAG 
ATCGGTTCGC CCCACCACCA CCACCACCAC CACCATCGCC ACCCCCAGCC GGCTACCTAC 
CAGACTTCCG GGAACCTGGG GGTGTCCTAC TCCCACTCAA GTTGTGGTCC AAGCTATGGC 
TCACAGAACT TCAGTGCGCC TTACAGCCCC TACGCGTTAA ATCAGGAAGC AGACGTAAGT 
GGTGGGTACC CCCAGTGCGC TCCCGCTGTT TACTCTGGAA ATCTCTCATC TCCCATGGTC 
CAGCATCACC ACCACCACCA GGGTTATGCT GGGGGCGCGG TGGGCTCGCC TCAATACATT 
CACCACTCAT AT GGACAGGA GCACCAGAGC CTGGCCCTGG CTACGTATAA TAACTCCTTG 
TCCCCTCTCC ACGCCAGCCA CCAAGAAGCC TGTCGCTCCC CCGCATCGGA GACATCTTCT 
CCAGCGCAGA CTTTTGACTG GATGAAAGTC AAAAGAAACC CTCCCAAAAC AGGGAAAGTT 
GGAGAGTACG GCTACCTGGG TCAACCCAAC GCGGTGCGCA CCAACTTCAC TACCAAGCAG 
CTCACGGAAC TGGAGAAGGA GTTCCACTTC AACAAGTACC TGACGCGCGC CCGCAGGGTG 
GAGATCGCTG^ CATCCCTGCA GCTCAACGAG ACCCAAGTGA AGATCTGGTT CCAGAACCGC 
CGAATGAAGC AAAAGAAACG TGAGAAGGAG GGTCTCTTGC CCATCTCTCC GGCCACCCCG 
CCAGGAAACG ACGAGAAGGC CGAGGAATCC TCAGAGAAGT CCAGCTCTTC GCCCTGCGTT 
CCTTCCCCGG GGTCTTCTAC CTCAGACACT CTGACTACCT CCCACTGA 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 335 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Asp Asn Ala Arg Met Asn Ser Phe Leu Glu Tyr Pro lie Leu Ser 
15 10 15 

Ser Gly Asp Ser Gly Thr Cys Ser Ala Arg Ala Tyr Pro Ser Asp His 
20 25 30 

Arg He Thr Thr Phe Gin Ser Cys Ala Val Ser Ala Asn Ser Cys Gly 
35 40 45 

Gly Asp Asp Arg Phe Leu Val Gly Arg Gly Val Gin He Gly Ser Pro 
50 55 60 

His His His His His His His His Arg His Pro Gin Pro Ala Thr Tyr 
65 70 75 80 

Gin Thr Ser Gly Asn Leu Gly Val Ser Tyr Ser His Ser Ser Cys Gly 
85 90 95 

Pro Ser Tyr Gly Ser Gin Asn Phe Ser Ala Pro Tyr Ser Pro Tyr Ala 
100 105 HO 

Leu Asn Gin Glu Ala Asp Val Ser Gly Gly Tyr Pro Gin Cys Ala Pro 
115 120 125 

Ala Val Tyr Ser Gly Asn Leu Ser Ser Pro Met Val Gin His His His 
130 135 140 

His His Gin Gly Tyr Ala Gly Gly Ala Val Gly Ser Pro Gin Tyr lie 
145 150 155 160 

His His Ser Tyr Gly Gin Glu His Gin Ser Leu Ala Leu Ala Thr Tyr 
165 170 175 

Asn Asn Ser Leu Ser Pro Leu His Ala Ser His Gin Glu Ala Cys Arg 
180 185 190 

Ser Pro Ala Ser Glu Thr Ser Ser Pro Ala Gin Thr Phe Asp Trp Met 
195 200 205 

Lys Val Lys Arg Asn Pro Pro Lys Thr Gly Lys Val Gly Glu Tyr Gly 
210 215 220 

Tyr Leu Gly Gin Pro Asn Ala Val Arg Thr Asn Phe Thr Thr Lys Gin 
225 230 235 240 

Leu Thr Glu Leu Glu Lys Glu Phe His Phe Asn Lys Tyr Leu Thr Arg 
245 250 255 

Ala Arg Arg Val Glu He Ala Ala Ser Leu Gin Leu Asn Glu Thr Gin 
260 265 270 

Val Lys He Trp Phe Gin Asn Arg Arg Met Lys Gin Lys Lys Arg Glu 
275 280 285 

Lys Glu Gly Leu Leu Pro He Ser Pro Ala Thr Pro Pro Gly Asn Asp 
290 295 300 
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Glu Lys Ala Glu Glu Ser Ser Glu Lys Ser Ser Ser Ser Pro Cys Val 
305 310 315 320 

Pro Ser Pro Gly Ser Ser Thr Ser Asp Thr Leu Thr Thr Ser His 
325 330 335 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1021 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

. B . (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

U 

40 TGACGCATGG ACTATAATAG GATGAACTCC TTCTTAGAGT ACCCACTCTG TAACCGGGGA 60 

Ul CCCAGCGCCT ACAGCGCCCA CAGCGCCCCA ACCTCCTTTC CCCCAAGCTC GGCTCAGGCG 120 
O 

}|f GTTGACAGCT ATGCAAGCGA GGGCCGCTAC GGTGGGGGGC TGTCCAGCCC TGCGTTTCAG 180 

|y| CAGAACTCCG GCTATCCCGC CCAGCAGCCG CCTTCGACCC TGGGGGTGCC CTTCCCCAGC 24 0 

L TCCGCGCCCT CGGGGTATGC TCCTGCCGCC TGCAGCCCCA GCTACGGGCC TTCTCAGTAC 300 
U 

Ifl TACCCTCTGG GTCAATCAGA AG GAG AC G G A GGCTATTTTC ATCCCTCGAG CTACGGGGCC 360 

y CAGCTAGGGG GCTTGTCCGA TGGCTACGGA GCAGGTGGAG CCGGTCCGGG GCCATATCCT 4 20 

% CCGCAGCATC CCCCTTATGG GAACGAGCAG ACCGCGAGCT TTGCACCGGC CTATGCTGAT 4 80 

CTCCTCTCCG AGGACAAGGA AACACCCTGC CCTTCAGAAC CTAACACCCC CACGGCCCGG 54 0 

ACCTTCGACT GGATGAAGGT TAAGAGAAAC CCACCCAAGA CAGCGAAGGT GTCAGAGCCA 600 

GGCCTGGGCT CGCCCAGTGG CCTCCGCACC AACTTCACCA CAAGGCAGCT GACAGAACTG 660 

G AAAAG GAG T TCCATTTCAA CAAGTACCTG AGCCGGGCCC GGAGGGTGGA GATTGCCGCC 720 

ACCCTGGAGC TCAATGAAAC ACAGGTCAAG ATTTGGTTCC AGAACCGACG AATGAAGCAG 780 

AAGAAGCGCG AGCGAGAGGG AGGTCGGGTC CCCCCAGCCC CACCAGGCTG CCCCAAGGAG 84 0 

GCAGCTGGAG ATGCCTCAGA CCAGTCGACA TGCACCTCCC CGGAAGCCTC ACCCAGCTCT 900 

GTCACCTCCT GAACTGAACC TAGCCACCAA TGGGGCTTCC AGGCACTGGA GCGCCCCAGT 960 

CCAGCCCTAT CCCAGGCTCT CCCAACCCAG GCCTGGCTTC ACTGCCTGGG ATCTCTAGGC 1020 

T 1021 
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INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 301 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 



Met Asp Tyr Asn Arg Met Asn Ser Phe Leu Glu Tyr Pro Leu Cys Asn 
15 10 15 

Arg Gly Pro Ser Ala Tyr Ser Ala His Ser Ala Pro Thr Ser Phe Pro 
20 25 30 

Pro Ser Ser Ala Gin Ala Val Asp Ser Tyr Ala Ser Glu Gly Arg Tyr 
35 40 45 

Gly Gly Gly Leu Ser Ser Pro Ala Phe Gin Gin Asn Ser Gly Tyr Pro 
50 55 60 

Ala Gin Gin Pro Pro Ser Thr Leu Gly Val Pro Phe Pro Ser Ser Ala 
65 70 75 80 

Pro Ser Gly Tyr Ala Pro Ala Ala Cys Ser Pro Ser Tyr Gly Pro Ser 
85 90 95 

Gin Tyr Tyr Pro Leu Gly Gin Ser Glu Gly Asp Gly Gly Tyr Phe His 
100 105 no 

Pro Ser Ser Tyr Gly Ala Gin Leu Gly Gly Leu Ser Asp Gly Tyr Gly 
H5 120 125 

Ala Gly Gly Ala Gly Pro Gly Pro Tyr Pro Pro Gin His Pro Pro Tyr 
130 135 140 

Gly Asn Glu Gin Thr Ala Ser Phe Ala Pro Ala Tyr Ala Asp Leu Leu 
I 45 150 155 160 

Ser Glu Asp Lys Glu Thr Pro Cys Pro Ser Glu Pro Asn Thr Pro Thr 
165 170 175 

Ala Arg Thr Phe Asp Trp Met Lys Val Lys Arg Asn Pro Pro Lys Thr 
180 185 190 

Ala Lys Val Ser Glu Pro Gly Leu Gly Ser Pro Ser Gly Leu Arg Thr 
195 200 205 

Asn Phe Thr Thr Arg Gin Leu Thr Glu Leu Glu Lys Glu Phe His Phe 
210 215 220 



Asn Lys Tyr Leu Ser Arg Ala Arg Arg Val Glu He Ala Ala Thr Leu 
225 230 235 240 

Glu Leu Asn Glu Thr Gin Val Lys He Trp Phe Gin Asn Arg Arg Met 
245 250 255 
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Lys Gin Lys Lys Arg Glu Arg Glu Gly Gly Arg Val Pro Pro Ala Pro 
260 265 270 

Pro Gly Cys Pro Lys Glu Ala Ala Gly Asp Ala Ser Asp Gin Ser Thr 
275 280 285 

Cys Thr Ser Pro Glu Ala Ser Pro Ser Ser Val Thr Ser 
290 295 300 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1030 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
CD) TOPOLOGY : linear 

*0 (ii) MOLECULE TYPE: cDNA 

m 

U\ (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

HI TGACGCATGG ACTATAATAG GATGAACTCC TTCTTAGAGT ACCCACTCTG TAACCGGGGA 60 

If 5 

iJl CCCAGCGCCT ACAGCGCCCA CAGCGCCCAC AGCGCCCCAA CCTCCTTTCC CCCAAGCTCG 120 



\'J GCGTTTCAGC AGAACTCCGG CTATCCCGCC CAGCAGCCGC CTTCGACCCT GGGGGTGCCC 

M TTCCCCAGCT CCGCGCCCTC GGGGTATGCT CCTGCCGCCT GCAGCCCCAG CTACGGGCCT 



GCTCAGGCGG TTGACAGCTA TGCAAGCGAG GGCCGCTACG GTGGGGGGCT GTCCAGCCCT 180 

240 
300 

TCTCAGTACT ACCCTCTGGG TCAATCAGAA GGAGACGGAG GCTATTTTCA TCCCTCGAGC 360 

TACGGGGCCC AGCTAGGGGG CTTGTCCGAT GGCTACGGAG CAGGTGGAGC CGGTCCGGGG 4 20 

CCATATCCTC CGCAGCATCC CCCTTATGGG AACGAGCAGA CCGCGAGCTT TGCACCGGCC 4 80 

TATGCTGATC TCCTCTCCGA GGACAAGGAA ACACCCTGCC CTTCAGAACC TAACACCCCC 54 0 

ACGGCCCGGA CCTTCGACTG GATGAAGGTT AAGAGAAACC CACCCAAGAC AGCGAAGGTG 600 

TCAGAGCCAG GCCTGGGCTC GCCCAGTGGC CTCCGCACCA ACTTCACCAC AAGGCAGCTG 660 

ACAGAACTGG AAAAGGAGTT CCATTTCAAC AAGTACCTGA GCCGGGCCCG GAGGGTGGAG 720 

ATTGCCGCCA CCCTGGAGCT CAATGAAACA CAGGTCAAGA TTTGGTTCCA GAACCGACGA 780 

AT GAAGCAGA AGAAGCGCGA GCG AG AG GG A GGTCGGGTCC CCCCAGCCCC ACCAGGCTGC 84 0 

CCCAAGGAGG CAGCTGGAGA TGCCTCAGAC CAGTCGACAT GCACCTCCCC GGAAGCCTCA 900 

CCCAGCTCTG TCACCTCCTG AACTGAACCT AGCCACCAAT GGGGCTTCCA GGCACTGGAG 960 

CGCCCCAGTC CAGCCCTATC CCAGGCTCTC CCAACCCAGG CCTGGCTTCA CTGCCTGGGA 1020 

TCTCTAGGCT 1030 
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(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 304 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Met Asp Tyr Asn Arg Met Asn Ser Phe Leu Glu Tyr Pro Leu Cys Asn 
1 5 10 15 

Arg Gly Pro Ser Ala Tyr Ser Ala His Ser Ala His Ser Ala Pro Thr 
20 25 30 

o 

\Q Ser Phe Pro Pro Ser Ser Ala Gin Ala Val Asp Ser Tyr Ala Ser Glu 

|jJJ 35 40 45 

111 

13 
ill 
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Gly Arg Tyr Gly Gly Gly Leu Ser Ser Pro Ala Phe Gin Gin Asn Ser 
50 55 60 



«" Gly Tyr Pro Ala Gin Gin Pro Pro Ser Thr Leu Gly Val Pro Phe Pro 

itl 65 70 75 80 



Ser Ser Ala Pro Ser Gly Tyr Ala Pro Ala Ala Cys Ser Pro Ser Tyr 
85 90 95 

Gly Pro Ser Gin Tyr Tyr Pro Leu Gly Gin Ser Glu Gly Asp Gly Gly 
100 105 110 

Tyr Phe His Pro Ser Ser Tyr Gly Ala Gin Leu Gly Gly Leu Ser Asp 
115 120 125 

Gly Tyr Gly Ala Gly Gly Ala Gly Pro Gly Pro Tyr Pro Pro Gin His 
130 135 140 

Pro Pro Tyr Gly Asn Glu Gin Thr Ala Ser Phe Ala Pro Ala Tyr Ala 
145 150 155 160 

Asp Leu Leu Ser Glu Asp Lys Glu Thr Pro Cys Pro Ser Glu Pro Asn 
165 170 175 

Thr Pro Thr Ala Arg Thr Phe Asp Trp Met Lys Val Lys Arg Asn Pro 
180 185 190 

Pro Lys Thr Ala Lys Val Ser Glu Pro Gly Leu Gly Ser Pro Ser Gly 
195 200 205 

Leu Arg Thr Asn Phe Thr Thr Arg Gin Leu Thr Glu Leu Glu Lys Glu 
210 215 220 

Phe His Phe Asn Lys Tyr Leu Ser Arg Ala Arg Arg Val Glu lie Ala 
225 230 235 240 
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Ala Thr Leu Glu Leu Asn Glu Thr Gin Val Lys He Trp Phe Gin Asn 
245 250 255 

Arg Arg Met Lys Gin Lys Lys Arg Glu Arg Glu Gly Gly Arg Val Pro 
260 265 270 

Pro Ala Pro Pro Gly Cys Pro Lys Glu Ala Ala Gly Asp Ala Ser Asp 
275 280 285 

Gin Ser Thr Cys Thr Ser Pro Glu Ala Ser Pro Ser Ser Val Thr Ser 
290 295 300 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

GCATGGACTA TAATAGGATG 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc - "primer" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



TCTTGGGTGG GTTTCTCTTA 



20 



